Asymptotically Efficient Adaptive Strategies in Repeated Games Part II. Asymptotic Optimality

نویسندگان

  • Nahum Shimkin
  • Adam Shwartz
چکیده

Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Asymptotically Efficient Adaptive Strategies in Repeated Games Part I: Certainty Equivalence Strategies

This paper addresses the problem of dynamic decision making in an uncertain and competitive environment. A decision maker (player 1) faces a system about which he has some (parametric) uncertainty, and which is affected also by the actions of other agents. We focus on a worst-case analysis from the viewpoint of player 1, using the simplified model of a repeated matrix game with lack of informat...

متن کامل

Asymptotically Efficient Adaptive Strategies in Repeated Games. Part I: Certainty Equivalence

Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at . http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive...

متن کامل

Subsolutions of an Isaacs Equation and Efficient Schemes for Importance Sampling

It was established in [6, 7] that importance sampling algorithms for estimating rare-event probabilities are intimately connected with two-person zero-sum differential games and the associated Isaacs equation. This game interpretation shows that dynamic or state-dependent schemes are needed in order to attain asymptotic optimality in a general setting. The purpose of the present paper is to sho...

متن کامل

Finite Time Analysis of Optimal Adaptive Policies for Linear-Quadratic Systems

We consider the classical problem of control of linear systems with quadratic cost. When the true system dynamics are unknown, an adaptive policy is required for learning the model parameters and planning a control policy simultaneously. Addressing this trade-off between accurate estimation and good control represents the main challenge in the area of adaptive control. Another important issue i...

متن کامل

Asymptotic Optimality of the Bayes Estimator on Differentiable in Quadratic Mean Models

This paper deals with the study of the Bayes estimator’s asymptotic properties on Differentiable in Quadratic Mean (DQM) models in the case of independent and identically distributed observations. The investigation is led in order to define weak assumptions on the model under which this estimator is asymptotically efficient, regular and asymptotically of minimal risk. The results of the paper a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Math. Oper. Res.

دوره 21  شماره 

صفحات  -

تاریخ انتشار 1996